Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 186123 |
| Missing cells | 108933 |
| Missing cells (%) | 2.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 19.7 MiB |
| Average record size in memory | 111.0 B |
Variable types
| Categorical | 4 |
|---|---|
| Text | 3 |
| Numeric | 9 |
| Boolean | 4 |
firing_type has a high cardinality: 107 distinct values | High cardinality |
firing_type is highly imbalanced (66.4%) | Imbalance |
flat_type has 25049 (13.5%) missing values | Missing |
telekom_uploadspeed has 22095 (11.9%) missing values | Missing |
firing_type has 35948 (19.3%) missing values | Missing |
heating_type has 25841 (13.9%) missing values | Missing |
immoscout_id has unique values | Unique |
floor has 20974 (11.3%) zeros | Zeros |
service_charge has 2265 (1.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-15 20:21:42.498209 |
|---|---|
| Analysis finished | 2024-04-15 20:21:56.255641 |
| Duration | 13.76 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
bundesland
Categorical
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| Sachsen | |
|---|---|
| Nordrhein_Westfalen | |
| Sachsen_Anhalt | |
| Bayern | |
| Hessen | |
| Other values (11) |
Length
| Max length | 22 |
|---|---|
| Median length | 18 |
| Mean length | 12.091708 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2250545 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Nordrhein_Westfalen |
|---|---|
| 2nd row | Sachsen |
| 3rd row | Bremen |
| 4th row | Sachsen |
| 5th row | Baden_Württemberg |
Common Values
| Value | Count | Frequency (%) |
| Sachsen | 43498 | |
| Nordrhein_Westfalen | 42173 | |
| Sachsen_Anhalt | 15816 | 8.5% |
| Bayern | 14492 | 7.8% |
| Hessen | 11277 | 6.1% |
| Baden_Württemberg | 10019 | 5.4% |
| Niedersachsen | 9122 | 4.9% |
| Berlin | 8638 | 4.6% |
| Thüringen | 5987 | 3.2% |
| Brandenburg | 5578 | 3.0% |
| Other values (6) | 19523 |
Length
| Value | Count | Frequency (%) |
| sachsen | 43498 | |
| nordrhein_westfalen | 42173 | |
| sachsen_anhalt | 15816 | 8.5% |
| bayern | 14492 | 7.8% |
| hessen | 11277 | 6.1% |
| baden_wã¼rttemberg | 10019 | 5.4% |
| niedersachsen | 9122 | 4.9% |
| berlin | 8638 | 4.6% |
| thã¼ringen | 5987 | 3.2% |
| brandenburg | 5578 | 3.0% |
| Other values (6) | 19523 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 333346 | |
| n | 262916 | 11.7% |
| r | 174158 | 7.7% |
| a | 170619 | 7.6% |
| s | 151041 | 6.7% |
| h | 141574 | 6.3% |
| l | 90621 | 4.0% |
| t | 82405 | 3.7% |
| _ | 82179 | 3.7% |
| i | 79460 | 3.5% |
| Other values (26) | 682226 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1868052 | |
| Uppercase Letter | 284308 | 12.6% |
| Connector Punctuation | 82179 | 3.7% |
| Other Number | 16006 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 333346 | |
| n | 262916 | |
| r | 174158 | |
| a | 170619 | |
| s | 151041 | |
| h | 141574 | |
| l | 90621 | 4.9% |
| t | 82405 | 4.4% |
| i | 79460 | 4.3% |
| c | 77823 | 4.2% |
| Other values (12) | 304089 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 64353 | |
| W | 52192 | |
| N | 51295 | |
| B | 40864 | |
| H | 18209 | 6.4% |
| Ã | 16006 | 5.6% |
| A | 15816 | 5.6% |
| T | 5987 | 2.1% |
| M | 5009 | 1.8% |
| V | 5009 | 1.8% |
| Other values (2) | 9568 | 3.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 82179 |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 16006 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2152360 | |
| Common | 98185 | 4.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 333346 | |
| n | 262916 | |
| r | 174158 | 8.1% |
| a | 170619 | 7.9% |
| s | 151041 | 7.0% |
| h | 141574 | 6.6% |
| l | 90621 | 4.2% |
| t | 82405 | 3.8% |
| i | 79460 | 3.7% |
| c | 77823 | 3.6% |
| Other values (24) | 588397 |
Common
| Value | Count | Frequency (%) |
| _ | 82179 | |
| ¼ | 16006 | 16.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2218533 | |
| None | 32012 | 1.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 333346 | |
| n | 262916 | |
| r | 174158 | 7.9% |
| a | 170619 | 7.7% |
| s | 151041 | 6.8% |
| h | 141574 | 6.4% |
| l | 90621 | 4.1% |
| t | 82405 | 3.7% |
| _ | 82179 | 3.7% |
| i | 79460 | 3.6% |
| Other values (24) | 650214 |
None
| Value | Count | Frequency (%) |
| Ã | 16006 | |
| ¼ | 16006 |
city
Text
| Distinct | 419 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 28 |
| Mean length | 11.713727 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2180194 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Dortmund |
|---|---|
| 2nd row | Dresden |
| 3rd row | Bremen |
| 4th row | Mittelsachsen_Kreis |
| 5th row | Emmendingen_Kreis |
| Value | Count | Frequency (%) |
| leipzig | 10808 | 5.8% |
| chemnitz | 9335 | 5.0% |
| berlin | 8638 | 4.6% |
| dresden | 5373 | 2.9% |
| magdeburg | 4212 | 2.3% |
| halle_saale | 3585 | 1.9% |
| mã¼nchen | 3344 | 1.8% |
| essen | 2993 | 1.6% |
| frankfurt_am_main | 2889 | 1.6% |
| dã¼sseldorf | 2792 | 1.5% |
| Other values (409) | 132154 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 287831 | 13.2% |
| r | 200632 | 9.2% |
| i | 185861 | 8.5% |
| s | 150386 | 6.9% |
| n | 142913 | 6.6% |
| a | 109890 | 5.0% |
| _ | 108756 | 5.0% |
| K | 74921 | 3.4% |
| l | 74013 | 3.4% |
| g | 66504 | 3.1% |
| Other values (44) | 778487 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1724090 | |
| Uppercase Letter | 321952 | 14.8% |
| Connector Punctuation | 108756 | 5.0% |
| Other Number | 14395 | 0.7% |
| Other Punctuation | 7411 | 0.3% |
| Currency Symbol | 3590 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 74921 | |
| Ã | 29673 | 9.2% |
| M | 25188 | 7.8% |
| B | 23096 | 7.2% |
| S | 20129 | 6.3% |
| L | 18570 | 5.8% |
| H | 17129 | 5.3% |
| D | 16953 | 5.3% |
| R | 12794 | 4.0% |
| C | 11060 | 3.4% |
| Other values (15) | 72439 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 287831 | |
| r | 200632 | |
| i | 185861 | |
| s | 150386 | 8.7% |
| n | 142913 | 8.3% |
| a | 109890 | 6.4% |
| l | 74013 | 4.3% |
| g | 66504 | 3.9% |
| t | 64357 | 3.7% |
| u | 63822 | 3.7% |
| Other values (14) | 377881 |
Other Punctuation
| Value | Count | Frequency (%) |
| ¶ | 7337 | |
| . | 74 | 1.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 108756 |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 14395 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 3590 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2046042 | |
| Common | 134152 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 287831 | |
| r | 200632 | 9.8% |
| i | 185861 | 9.1% |
| s | 150386 | 7.4% |
| n | 142913 | 7.0% |
| a | 109890 | 5.4% |
| K | 74921 | 3.7% |
| l | 74013 | 3.6% |
| g | 66504 | 3.3% |
| t | 64357 | 3.1% |
| Other values (39) | 688734 |
Common
| Value | Count | Frequency (%) |
| _ | 108756 | |
| ¼ | 14395 | 10.7% |
| ¶ | 7337 | 5.5% |
| ¤ | 3590 | 2.7% |
| . | 74 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2120848 | |
| None | 59346 | 2.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 287831 | |
| r | 200632 | 9.5% |
| i | 185861 | 8.8% |
| s | 150386 | 7.1% |
| n | 142913 | 6.7% |
| a | 109890 | 5.2% |
| _ | 108756 | 5.1% |
| K | 74921 | 3.5% |
| l | 74013 | 3.5% |
| g | 66504 | 3.1% |
| Other values (39) | 719141 |
None
| Value | Count | Frequency (%) |
| Ã | 29673 | |
| ¼ | 14395 | |
| ¶ | 7337 | 12.4% |
| Ÿ | 4351 | 7.3% |
| ¤ | 3590 | 6.0% |
district
Text
| Distinct | 7635 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 38 |
| Mean length | 11.374505 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2117057 |
|---|---|
| Distinct characters | 70 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 1582 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Schüren |
|---|---|
| 2nd row | Äußere_Neustadt_Antonstadt |
| 3rd row | Neu_Schwachhausen |
| 4th row | Freiberg |
| 5th row | Denzlingen |
| Value | Count | Frequency (%) |
| innenstadt | 3132 | 1.7% |
| stadtmitte | 1972 | 1.1% |
| altstadt | 1776 | 1.0% |
| sonnenberg | 1499 | 0.8% |
| kaãÿberg | 1317 | 0.7% |
| mitte | 1071 | 0.6% |
| schloãÿchemnitz | 993 | 0.5% |
| hilbersdorf | 977 | 0.5% |
| sã¼dstadt | 857 | 0.5% |
| zentrum | 797 | 0.4% |
| Other values (7625) | 171732 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 267900 | 12.7% |
| n | 155124 | 7.3% |
| r | 153922 | 7.3% |
| t | 151118 | 7.1% |
| a | 122662 | 5.8% |
| i | 101955 | 4.8% |
| d | 97043 | 4.6% |
| s | 96948 | 4.6% |
| l | 86341 | 4.1% |
| h | 81361 | 3.8% |
| Other values (60) | 802683 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1718070 | |
| Uppercase Letter | 295621 | 14.0% |
| Connector Punctuation | 61592 | 2.9% |
| Other Punctuation | 20538 | 1.0% |
| Other Number | 18435 | 0.9% |
| Currency Symbol | 2205 | 0.1% |
| Open Punctuation | 272 | < 0.1% |
| Decimal Number | 172 | < 0.1% |
| Dash Punctuation | 152 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Ã | 42256 | |
| S | 33300 | 11.3% |
| B | 21721 | 7.3% |
| N | 18508 | 6.3% |
| H | 16830 | 5.7% |
| W | 16412 | 5.6% |
| M | 15909 | 5.4% |
| L | 13253 | 4.5% |
| G | 12381 | 4.2% |
| A | 10854 | 3.7% |
| Other values (18) | 94197 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 267900 | |
| n | 155124 | 9.0% |
| r | 153922 | 9.0% |
| t | 151118 | 8.8% |
| a | 122662 | 7.1% |
| i | 101955 | 5.9% |
| d | 97043 | 5.6% |
| s | 96948 | 5.6% |
| l | 86341 | 5.0% |
| h | 81361 | 4.7% |
| Other values (17) | 403696 |
Other Punctuation
| Value | Count | Frequency (%) |
| ¶ | 13493 | |
| / | 5476 | |
| . | 949 | 4.6% |
| & | 411 | 2.0% |
| , | 209 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 59 | |
| 5 | 42 | |
| 4 | 35 | |
| 2 | 18 | 10.5% |
| 3 | 18 | 10.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 61592 |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 18435 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 2205 |
Open Punctuation
| Value | Count | Frequency (%) |
| „ | 272 |
Dash Punctuation
| Value | Count | Frequency (%) |
| – | 152 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2013691 | |
| Common | 103366 | 4.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 267900 | 13.3% |
| n | 155124 | 7.7% |
| r | 153922 | 7.6% |
| t | 151118 | 7.5% |
| a | 122662 | 6.1% |
| i | 101955 | 5.1% |
| d | 97043 | 4.8% |
| s | 96948 | 4.8% |
| l | 86341 | 4.3% |
| h | 81361 | 4.0% |
| Other values (45) | 699317 |
Common
| Value | Count | Frequency (%) |
| _ | 61592 | |
| ¼ | 18435 | 17.8% |
| ¶ | 13493 | 13.1% |
| / | 5476 | 5.3% |
| ¤ | 2205 | 2.1% |
| . | 949 | 0.9% |
| & | 411 | 0.4% |
| „ | 272 | 0.3% |
| , | 209 | 0.2% |
| – | 152 | 0.1% |
| Other values (5) | 172 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2032545 | |
| None | 84088 | 4.0% |
| Punctuation | 424 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 267900 | 13.2% |
| n | 155124 | 7.6% |
| r | 153922 | 7.6% |
| t | 151118 | 7.4% |
| a | 122662 | 6.0% |
| i | 101955 | 5.0% |
| d | 97043 | 4.8% |
| s | 96948 | 4.8% |
| l | 86341 | 4.2% |
| h | 81361 | 4.0% |
| Other values (52) | 718171 |
None
| Value | Count | Frequency (%) |
| Ã | 42256 | |
| ¼ | 18435 | |
| ¶ | 13493 | 16.0% |
| Ÿ | 7447 | 8.9% |
| ¤ | 2205 | 2.6% |
| Å“ | 252 | 0.3% |
Punctuation
| Value | Count | Frequency (%) |
| „ | 272 | |
| – | 152 |
street
Text
| Distinct | 42737 |
|---|---|
| Distinct (%) | 23.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
Length
| Max length | 89 |
|---|---|
| Median length | 54 |
| Mean length | 16.861871 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3138382 |
|---|---|
| Distinct characters | 85 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 24774 ? |
|---|---|
| Unique (%) | 13.3% |
Sample
| 1st row | Schüruferstraße |
|---|---|
| 2nd row | Turnerweg |
| 3rd row | Hermann-Henrich-Meier-Allee |
| 4th row | Am Bahnhof |
| 5th row | no_information |
| Value | Count | Frequency (%) |
| no_information | 37835 | 16.0% |
| straße | 16053 | 6.8% |
| str | 10541 | 4.4% |
| am | 4788 | 2.0% |
| der | 1952 | 0.8% |
| weg | 1810 | 0.8% |
| an | 1177 | 0.5% |
| im | 925 | 0.4% |
| strasse | 804 | 0.3% |
| hauptstraße | 760 | 0.3% |
| Other values (37522) | 160232 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 290379 | 9.3% |
| e | 280415 | 8.9% |
| i | 219963 | 7.0% |
| n | 215897 | 6.9% |
| t | 212531 | 6.8% |
| s | 197042 | 6.3% |
| a | 194188 | 6.2% |
| o | 164524 | 5.2% |
| l | 156832 | 5.0% |
| g | 112664 | 3.6% |
| Other values (75) | 1093947 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2546427 | |
| Uppercase Letter | 239427 | 7.6% |
| Other Punctuation | 216656 | 6.9% |
| Space Separator | 50826 | 1.6% |
| Dash Punctuation | 45873 | 1.5% |
| Connector Punctuation | 37863 | 1.2% |
| Decimal Number | 1220 | < 0.1% |
| Close Punctuation | 41 | < 0.1% |
| Open Punctuation | 41 | < 0.1% |
| Math Symbol | 2 | < 0.1% |
| Other values (5) | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 290379 | |
| e | 280415 | |
| i | 219963 | |
| n | 215897 | |
| t | 212531 | |
| s | 197042 | 7.7% |
| a | 194188 | 7.6% |
| o | 164524 | 6.5% |
| l | 156832 | 6.2% |
| g | 112664 | 4.4% |
| Other values (18) | 501992 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 64317 | |
| B | 16623 | 6.9% |
| H | 16393 | 6.8% |
| A | 15868 | 6.6% |
| W | 12744 | 5.3% |
| K | 12639 | 5.3% |
| L | 10942 | 4.6% |
| M | 10572 | 4.4% |
| R | 10545 | 4.4% |
| G | 10166 | 4.2% |
| Other values (17) | 58618 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 274 | |
| 5 | 161 | |
| 4 | 147 | |
| 2 | 128 | |
| 3 | 122 | |
| 9 | 84 | 6.9% |
| 6 | 82 | 6.7% |
| 8 | 81 | 6.6% |
| 7 | 79 | 6.5% |
| 0 | 62 | 5.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 87572 | |
| & | 87569 | |
| . | 41267 | |
| , | 125 | 0.1% |
| / | 115 | 0.1% |
| : | 3 | < 0.1% |
| ¿ | 2 | < 0.1% |
| * | 2 | < 0.1% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 50826 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 45873 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 37863 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 41 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 41 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2785855 | |
| Common | 352527 | 11.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 290379 | 10.4% |
| e | 280415 | 10.1% |
| i | 219963 | 7.9% |
| n | 215897 | 7.7% |
| t | 212531 | 7.6% |
| s | 197042 | 7.1% |
| a | 194188 | 7.0% |
| o | 164524 | 5.9% |
| l | 156832 | 5.6% |
| g | 112664 | 4.0% |
| Other values (46) | 741420 |
Common
| Value | Count | Frequency (%) |
| ; | 87572 | |
| & | 87569 | |
| 50826 | ||
| - | 45873 | |
| . | 41267 | |
| _ | 37863 | |
| 1 | 274 | 0.1% |
| 5 | 161 | < 0.1% |
| 4 | 147 | < 0.1% |
| 2 | 128 | < 0.1% |
| Other values (19) | 847 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3138371 | |
| None | 9 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 290379 | 9.3% |
| e | 280415 | 8.9% |
| i | 219963 | 7.0% |
| n | 215897 | 6.9% |
| t | 212531 | 6.8% |
| s | 197042 | 6.3% |
| a | 194188 | 6.2% |
| o | 164524 | 5.2% |
| l | 156832 | 5.0% |
| g | 112664 | 3.6% |
| Other values (67) | 1093936 |
None
| Value | Count | Frequency (%) |
| ¿ | 2 | |
| ï | 2 | |
| ½ | 2 | |
| Ã… | 1 | |
| â | 1 | |
| ª | 1 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 |
zip_code
Real number (ℝ)
| Distinct | 6884 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35642.157 |
| Minimum | 852 |
|---|---|
| Maximum | 99994 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 852 |
|---|---|
| 5-th percentile | 2625 |
| Q1 | 9113 |
| median | 35796 |
| Q3 | 53844 |
| 95-th percentile | 87700 |
| Maximum | 99994 |
| Range | 99142 |
| Interquartile range (IQR) | 44731 |
Descriptive statistics
| Standard deviation | 27816.579 |
|---|---|
| Coefficient of variation (CV) | 0.7804404 |
| Kurtosis | -0.81335796 |
| Mean | 35642.157 |
| Median Absolute Deviation (MAD) | 25552 |
| Skewness | 0.50985073 |
| Sum | 6.6338252 × 109 |
| Variance | 7.7376208 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9130 | 1558 | 0.8% |
| 9126 | 1518 | 0.8% |
| 9112 | 1244 | 0.7% |
| 9131 | 1209 | 0.6% |
| 9113 | 1107 | 0.6% |
| 8056 | 836 | 0.4% |
| 39112 | 774 | 0.4% |
| 6217 | 764 | 0.4% |
| 4157 | 761 | 0.4% |
| 39108 | 720 | 0.4% |
| Other values (6874) | 175632 |
| Value | Count | Frequency (%) |
| 852 | 1 | < 0.1% |
| 853 | 1 | < 0.1% |
| 1057 | 3 | < 0.1% |
| 1067 | 566 | |
| 1069 | 158 | 0.1% |
| 1097 | 310 | |
| 1099 | 368 | |
| 1108 | 27 | < 0.1% |
| 1109 | 64 | < 0.1% |
| 1127 | 152 | 0.1% |
| Value | Count | Frequency (%) |
| 99994 | 6 | < 0.1% |
| 99991 | 5 | < 0.1% |
| 99976 | 4 | < 0.1% |
| 99974 | 120 | |
| 99958 | 1 | < 0.1% |
| 99955 | 3 | < 0.1% |
| 99947 | 68 | |
| 99898 | 1 | < 0.1% |
| 99897 | 1 | < 0.1% |
| 99894 | 4 | < 0.1% |
has_kitchen
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 122177 | |
| True | 63946 |
balcony
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 117961 | |
| False | 68162 |
lift
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 140148 | |
| True | 45975 | 24.7% |
garden
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 149127 | |
| True | 36996 | 19.9% |
floor
Real number (ℝ)
ZEROS 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1089065 |
| Minimum | -1 |
|---|---|
| Maximum | 45 |
| Zeros | 20974 |
| Zeros (%) | 11.3% |
| Negative | 259 |
| Negative (%) | 0.1% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 45 |
| Range | 46 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.669126 |
|---|---|
| Coefficient of variation (CV) | 0.79146516 |
| Kurtosis | 18.089261 |
| Mean | 2.1089065 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.4111115 |
| Sum | 392516 |
| Variance | 2.7859816 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 54343 | |
| 2 | 48763 | |
| 3 | 32591 | |
| 0 | 20974 | 11.3% |
| 4 | 17395 | 9.3% |
| 5 | 6917 | 3.7% |
| 6 | 2079 | 1.1% |
| 7 | 894 | 0.5% |
| 8 | 502 | 0.3% |
| 9 | 371 | 0.2% |
| Other values (24) | 1294 | 0.7% |
| Value | Count | Frequency (%) |
| -1 | 259 | 0.1% |
| 0 | 20974 | 11.3% |
| 1 | 54343 | |
| 2 | 48763 | |
| 3 | 32591 | |
| 4 | 17395 | 9.3% |
| 5 | 6917 | 3.7% |
| 6 | 2079 | 1.1% |
| 7 | 894 | 0.5% |
| 8 | 502 | 0.3% |
| Value | Count | Frequency (%) |
| 45 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 29 | 2 | |
| 26 | 2 | |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
flat_type
Categorical
MISSING 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 25049 |
| Missing (%) | 13.5% |
| Memory size | 1.6 MiB |
| apartment | |
|---|---|
| roof_storey | |
| ground_floor | |
| other | 6532 |
| maisonette | 6253 |
| Other values (5) | 9135 |
Length
| Max length | 19 |
|---|---|
| Median length | 9 |
| Mean length | 9.7160808 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1565008 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ground_floor |
|---|---|
| 2nd row | apartment |
| 3rd row | apartment |
| 4th row | roof_storey |
| 5th row | apartment |
Common Values
| Value | Count | Frequency (%) |
| apartment | 99518 | |
| roof_storey | 23694 | 12.7% |
| ground_floor | 15942 | 8.6% |
| other | 6532 | 3.5% |
| maisonette | 6253 | 3.4% |
| raised_ground_floor | 3128 | 1.7% |
| penthouse | 2420 | 1.3% |
| terraced_flat | 2228 | 1.2% |
| half_basement | 734 | 0.4% |
| loft | 625 | 0.3% |
| (Missing) | 25049 | 13.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| apartment | 99518 | |
| roof_storey | 23694 | 14.7% |
| ground_floor | 15942 | 9.9% |
| other | 6532 | 4.1% |
| maisonette | 6253 | 3.9% |
| raised_ground_floor | 3128 | 1.9% |
| penthouse | 2420 | 1.5% |
| terraced_flat | 2228 | 1.4% |
| half_basement | 734 | 0.5% |
| loft | 625 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 250003 | |
| a | 214341 | |
| r | 199162 | |
| e | 156142 | |
| o | 144122 | |
| n | 127995 | |
| m | 106505 | |
| p | 101938 | |
| _ | 48854 | 3.1% |
| f | 46351 | 3.0% |
| Other values (10) | 169595 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1516154 | |
| Connector Punctuation | 48854 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 250003 | |
| a | 214341 | |
| r | 199162 | |
| e | 156142 | |
| o | 144122 | |
| n | 127995 | |
| m | 106505 | |
| p | 101938 | |
| f | 46351 | 3.1% |
| s | 36229 | 2.4% |
| Other values (9) | 133366 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 48854 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1516154 | |
| Common | 48854 | 3.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 250003 | |
| a | 214341 | |
| r | 199162 | |
| e | 156142 | |
| o | 144122 | |
| n | 127995 | |
| m | 106505 | |
| p | 101938 | |
| f | 46351 | 3.1% |
| s | 36229 | 2.4% |
| Other values (9) | 133366 |
Common
| Value | Count | Frequency (%) |
| _ | 48854 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1565008 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 250003 | |
| a | 214341 | |
| r | 199162 | |
| e | 156142 | |
| o | 144122 | |
| n | 127995 | |
| m | 106505 | |
| p | 101938 | |
| _ | 48854 | 3.1% |
| f | 46351 | 3.0% |
| Other values (10) | 169595 |
telekom_uploadspeed
Real number (ℝ)
MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 22095 |
| Missing (%) | 11.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.01464 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.4 |
| Q1 | 10 |
| median | 40 |
| Q3 | 40 |
| 95-th percentile | 40 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 16.263861 |
|---|---|
| Coefficient of variation (CV) | 0.56053982 |
| Kurtosis | -1.0411583 |
| Mean | 29.01464 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.75085262 |
| Sum | 4759213.4 |
| Variance | 264.51318 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 111157 | |
| 2.4 | 29121 | 15.6% |
| 10 | 22774 | 12.2% |
| 5 | 689 | 0.4% |
| 1 | 142 | 0.1% |
| 100 | 116 | 0.1% |
| 4 | 29 | < 0.1% |
| (Missing) | 22095 | 11.9% |
| Value | Count | Frequency (%) |
| 1 | 142 | 0.1% |
| 2.4 | 29121 | 15.6% |
| 4 | 29 | < 0.1% |
| 5 | 689 | 0.4% |
| 10 | 22774 | 12.2% |
| 40 | 111157 | |
| 100 | 116 | 0.1% |
| Value | Count | Frequency (%) |
| 100 | 116 | 0.1% |
| 40 | 111157 | |
| 10 | 22774 | 12.2% |
| 5 | 689 | 0.4% |
| 4 | 29 | < 0.1% |
| 2.4 | 29121 | 15.6% |
| 1 | 142 | 0.1% |
firing_type
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 107 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 35948 |
| Missing (%) | 19.3% |
| Memory size | 1.6 MiB |
| gas | |
|---|---|
| district_heating | |
| oil | |
| natural_gas_light | |
| electricity | 3179 |
| Other values (102) |
Length
| Max length | 187 |
|---|---|
| Median length | 3 |
| Mean length | 8.4514799 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1269201 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | oil |
|---|---|
| 2nd row | gas |
| 3rd row | oil |
| 4th row | gas |
| 5th row | gas |
Common Values
| Value | Count | Frequency (%) |
| gas | 76209 | |
| district_heating | 38253 | |
| oil | 11989 | 6.4% |
| natural_gas_light | 7811 | 4.2% |
| electricity | 3179 | 1.7% |
| natural_gas_heavy | 2934 | 1.6% |
| geothermal | 1662 | 0.9% |
| pellet_heating | 1660 | 0.9% |
| gas:electricity | 906 | 0.5% |
| combined_heat_and_power_fossil_fuels | 755 | 0.4% |
| Other values (97) | 4817 | 2.6% |
| (Missing) | 35948 |
Length
| Value | Count | Frequency (%) |
| gas | 76209 | |
| district_heating | 38253 | |
| oil | 11989 | 8.0% |
| natural_gas_light | 7811 | 5.2% |
| electricity | 3179 | 2.1% |
| natural_gas_heavy | 2934 | 2.0% |
| geothermal | 1662 | 1.1% |
| pellet_heating | 1660 | 1.1% |
| gas:electricity | 906 | 0.6% |
| combined_heat_and_power_fossil_fuels | 755 | 0.5% |
| Other values (97) | 4817 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 163887 | |
| t | 156553 | |
| i | 154530 | |
| g | 142185 | |
| s | 132613 | |
| _ | 72388 | 5.7% |
| e | 71154 | 5.6% |
| r | 60936 | 4.8% |
| n | 58650 | 4.6% |
| h | 57430 | 4.5% |
| Other values (15) | 198875 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1194552 | |
| Connector Punctuation | 72388 | 5.7% |
| Other Punctuation | 2261 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 163887 | |
| t | 156553 | |
| i | 154530 | |
| g | 142185 | |
| s | 132613 | |
| e | 71154 | |
| r | 60936 | 5.1% |
| n | 58650 | 4.9% |
| h | 57430 | 4.8% |
| c | 50734 | 4.2% |
| Other values (13) | 145880 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 72388 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2261 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1194552 | |
| Common | 74649 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 163887 | |
| t | 156553 | |
| i | 154530 | |
| g | 142185 | |
| s | 132613 | |
| e | 71154 | |
| r | 60936 | 5.1% |
| n | 58650 | 4.9% |
| h | 57430 | 4.8% |
| c | 50734 | 4.2% |
| Other values (13) | 145880 |
Common
| Value | Count | Frequency (%) |
| _ | 72388 | |
| : | 2261 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1269201 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 163887 | |
| t | 156553 | |
| i | 154530 | |
| g | 142185 | |
| s | 132613 | |
| _ | 72388 | 5.7% |
| e | 71154 | 5.6% |
| r | 60936 | 4.8% |
| n | 58650 | 4.6% |
| h | 57430 | 4.5% |
| Other values (15) | 198875 |
heating_type
Categorical
MISSING 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 25841 |
| Missing (%) | 13.9% |
| Memory size | 1.6 MiB |
| central_heating | |
|---|---|
| district_heating | |
| gas_heating | |
| self_contained_central_heating | |
| floor_heating | |
| Other values (8) |
Length
| Max length | 30 |
|---|---|
| Median length | 15 |
| Mean length | 15.802511 |
| Min length | 9 |
Characters and Unicode
| Total characters | 2532858 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | central_heating |
|---|---|
| 2nd row | floor_heating |
| 3rd row | self_contained_central_heating |
| 4th row | self_contained_central_heating |
| 5th row | oil_heating |
Common Values
| Value | Count | Frequency (%) |
| central_heating | 91009 | |
| district_heating | 19280 | 10.4% |
| gas_heating | 15171 | 8.2% |
| self_contained_central_heating | 12671 | 6.8% |
| floor_heating | 12513 | 6.7% |
| oil_heating | 3619 | 1.9% |
| heat_pump | 1760 | 0.9% |
| combined_heat_and_power_plant | 1580 | 0.8% |
| night_storage_heater | 1018 | 0.5% |
| wood_pellet_heating | 717 | 0.4% |
| Other values (3) | 944 | 0.5% |
| (Missing) | 25841 | 13.9% |
Length
| Value | Count | Frequency (%) |
| central_heating | 91009 | |
| district_heating | 19280 | 12.0% |
| gas_heating | 15171 | 9.5% |
| self_contained_central_heating | 12671 | 7.9% |
| floor_heating | 12513 | 7.8% |
| oil_heating | 3619 | 2.3% |
| heat_pump | 1760 | 1.1% |
| combined_heat_and_power_plant | 1580 | 1.0% |
| night_storage_heater | 1018 | 0.6% |
| wood_pellet_heating | 717 | 0.4% |
| Other values (3) | 944 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 320339 | |
| e | 297360 | |
| a | 296113 | |
| n | 290704 | |
| i | 213985 | |
| _ | 192099 | |
| g | 173131 | |
| h | 161300 | |
| r | 139833 | |
| c | 138437 | |
| Other values (11) | 309557 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2340759 | |
| Connector Punctuation | 192099 | 7.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 320339 | |
| e | 297360 | |
| a | 296113 | |
| n | 290704 | |
| i | 213985 | |
| g | 173131 | |
| h | 161300 | |
| r | 139833 | |
| c | 138437 | |
| l | 136241 | |
| Other values (10) | 173316 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 192099 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2340759 | |
| Common | 192099 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 320339 | |
| e | 297360 | |
| a | 296113 | |
| n | 290704 | |
| i | 213985 | |
| g | 173131 | |
| h | 161300 | |
| r | 139833 | |
| c | 138437 | |
| l | 136241 | |
| Other values (10) | 173316 |
Common
| Value | Count | Frequency (%) |
| _ | 192099 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2532858 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 320339 | |
| e | 297360 | |
| a | 296113 | |
| n | 290704 | |
| i | 213985 | |
| _ | 192099 | |
| g | 173131 | |
| h | 161300 | |
| r | 139833 | |
| c | 138437 | |
| Other values (11) | 309557 |
number_of_rooms
Real number (ℝ)
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6626693 |
| Minimum | 1 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 16 |
| Range | 15 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.98469852 |
|---|---|
| Coefficient of variation (CV) | 0.3698163 |
| Kurtosis | 1.7996565 |
| Mean | 2.6626693 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.61704796 |
| Sum | 495584 |
| Variance | 0.96963117 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 70795 | |
| 2 | 65207 | |
| 4 | 25144 | 13.5% |
| 1 | 18644 | 10.0% |
| 5 | 4974 | 2.7% |
| 6 | 1018 | 0.5% |
| 7 | 225 | 0.1% |
| 8 | 79 | < 0.1% |
| 9 | 20 | < 0.1% |
| 10 | 6 | < 0.1% |
| Other values (5) | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 18644 | 10.0% |
| 2 | 65207 | |
| 3 | 70795 | |
| 4 | 25144 | 13.5% |
| 5 | 4974 | 2.7% |
| 6 | 1018 | 0.5% |
| 7 | 225 | 0.1% |
| 8 | 79 | < 0.1% |
| 9 | 20 | < 0.1% |
| 10 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 16 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 13 | 2 | < 0.1% |
| 12 | 2 | < 0.1% |
| 11 | 4 | < 0.1% |
| 10 | 6 | < 0.1% |
| 9 | 20 | < 0.1% |
| 8 | 79 | < 0.1% |
| 7 | 225 | 0.1% |
| 6 | 1018 |
square_meter
Real number (ℝ)
| Distinct | 329 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 72.505037 |
| Minimum | 0 |
|---|---|
| Maximum | 649 |
| Zeros | 35 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 54 |
| median | 67 |
| Q3 | 85 |
| 95-th percentile | 129 |
| Maximum | 649 |
| Range | 649 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 30.445359 |
|---|---|
| Coefficient of variation (CV) | 0.41990682 |
| Kurtosis | 7.8370541 |
| Mean | 72.505037 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 1.7672785 |
| Sum | 13494855 |
| Variance | 926.9199 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 5925 | 3.2% |
| 70 | 4498 | 2.4% |
| 65 | 4240 | 2.3% |
| 58 | 3882 | 2.1% |
| 57 | 3786 | 2.0% |
| 50 | 3778 | 2.0% |
| 61 | 3747 | 2.0% |
| 55 | 3612 | 1.9% |
| 59 | 3573 | 1.9% |
| 80 | 3525 | 1.9% |
| Other values (319) | 145557 |
| Value | Count | Frequency (%) |
| 0 | 35 | |
| 3 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 10 | < 0.1% |
| 10 | 35 | |
| 11 | 35 | |
| 12 | 83 | |
| 13 | 54 | |
| 14 | 64 |
| Value | Count | Frequency (%) |
| 649 | 1 | |
| 527 | 1 | |
| 482 | 1 | |
| 480 | 2 | |
| 446 | 1 | |
| 430 | 1 | |
| 423 | 1 | |
| 420 | 2 | |
| 413 | 1 | |
| 400 | 2 |
base_rent
Real number (ℝ)
| Distinct | 2646 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 634.75968 |
| Minimum | 0 |
|---|---|
| Maximum | 8700 |
| Zeros | 17 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 229 |
| Q1 | 330 |
| median | 480 |
| Q3 | 785 |
| 95-th percentile | 1500 |
| Maximum | 8700 |
| Range | 8700 |
| Interquartile range (IQR) | 455 |
Descriptive statistics
| Standard deviation | 479.01583 |
|---|---|
| Coefficient of variation (CV) | 0.75464124 |
| Kurtosis | 18.999186 |
| Mean | 634.75968 |
| Median Absolute Deviation (MAD) | 181 |
| Skewness | 3.1503837 |
| Sum | 1.1814338 × 108 |
| Variance | 229456.17 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 350 | 2620 | 1.4% |
| 450 | 2362 | 1.3% |
| 300 | 2290 | 1.2% |
| 400 | 2121 | 1.1% |
| 650 | 1887 | 1.0% |
| 550 | 1820 | 1.0% |
| 320 | 1777 | 1.0% |
| 500 | 1684 | 0.9% |
| 750 | 1667 | 0.9% |
| 330 | 1652 | 0.9% |
| Other values (2636) | 166243 |
| Value | Count | Frequency (%) |
| 0 | 17 | |
| 1 | 1 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 3 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| 42 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8700 | 1 | |
| 8500 | 1 | |
| 8400 | 1 | |
| 7850 | 1 | |
| 7830 | 1 | |
| 7600 | 1 | |
| 7200 | 1 | |
| 7020 | 1 | |
| 7000 | 2 | |
| 6990 | 2 |
total_rent
Real number (ℝ)
| Distinct | 3213 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 805.60146 |
| Minimum | 0 |
|---|---|
| Maximum | 9000 |
| Zeros | 214 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 325 |
| Q1 | 465 |
| median | 640 |
| Q3 | 975 |
| 95-th percentile | 1785 |
| Maximum | 9000 |
| Range | 9000 |
| Interquartile range (IQR) | 510 |
Descriptive statistics
| Standard deviation | 538.97373 |
|---|---|
| Coefficient of variation (CV) | 0.66903272 |
| Kurtosis | 18.082371 |
| Mean | 805.60146 |
| Median Absolute Deviation (MAD) | 212 |
| Skewness | 3.0563415 |
| Sum | 1.4994096 × 108 |
| Variance | 290492.69 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 450 | 1556 | 0.8% |
| 500 | 1536 | 0.8% |
| 600 | 1427 | 0.8% |
| 550 | 1340 | 0.7% |
| 400 | 1244 | 0.7% |
| 490 | 1220 | 0.7% |
| 750 | 1180 | 0.6% |
| 480 | 1175 | 0.6% |
| 470 | 1166 | 0.6% |
| 420 | 1164 | 0.6% |
| Other values (3203) | 173115 |
| Value | Count | Frequency (%) |
| 0 | 214 | |
| 1 | 12 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 5 | < 0.1% |
| 6 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 50 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9000 | 1 | |
| 8780 | 1 | |
| 8706 | 1 | |
| 8645 | 1 | |
| 8550 | 1 | |
| 8500 | 1 | |
| 8430 | 1 | |
| 8200 | 1 | |
| 7850 | 1 | |
| 7845 | 1 |
service_charge
Real number (ℝ)
ZEROS 
| Distinct | 732 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 149.61397 |
| Minimum | 0 |
|---|---|
| Maximum | 6045 |
| Zeros | 2265 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 95 |
| median | 135 |
| Q3 | 187 |
| 95-th percentile | 300 |
| Maximum | 6045 |
| Range | 6045 |
| Interquartile range (IQR) | 92 |
Descriptive statistics
| Standard deviation | 86.084007 |
|---|---|
| Coefficient of variation (CV) | 0.57537412 |
| Kurtosis | 136.89001 |
| Mean | 149.61397 |
| Median Absolute Deviation (MAD) | 45 |
| Skewness | 4.1294227 |
| Sum | 27846601 |
| Variance | 7410.4562 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 150 | 9969 | 5.4% |
| 100 | 9034 | 4.9% |
| 200 | 7663 | 4.1% |
| 120 | 7574 | 4.1% |
| 130 | 5415 | 2.9% |
| 140 | 5148 | 2.8% |
| 80 | 4902 | 2.6% |
| 180 | 4676 | 2.5% |
| 90 | 4469 | 2.4% |
| 110 | 4450 | 2.4% |
| Other values (722) | 122823 |
| Value | Count | Frequency (%) |
| 0 | 2265 | |
| 1 | 18 | < 0.1% |
| 2 | 14 | < 0.1% |
| 5 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 6 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 15 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 6045 | 1 | |
| 2150 | 1 | |
| 1977 | 1 | |
| 1920 | 1 | |
| 1837 | 1 | |
| 1800 | 1 | |
| 1740 | 1 | |
| 1700 | 1 | |
| 1580 | 1 | |
| 1540 | 1 |
immoscout_id
Real number (ℝ)
UNIQUE 
| Distinct | 186123 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.065018 × 108 |
| Minimum | 28871743 |
|---|---|
| Maximum | 1.1571166 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 28871743 |
|---|---|
| 5-th percentile | 76003942 |
| Q1 | 1.0645841 × 108 |
| median | 1.1106221 × 108 |
| Q3 | 1.1374517 × 108 |
| 95-th percentile | 1.155979 × 108 |
| Maximum | 1.1571166 × 108 |
| Range | 86839917 |
| Interquartile range (IQR) | 7286763.5 |
Descriptive statistics
| Standard deviation | 13003773 |
|---|---|
| Coefficient of variation (CV) | 0.12209909 |
| Kurtosis | 7.3399625 |
| Mean | 1.065018 × 108 |
| Median Absolute Deviation (MAD) | 3686983 |
| Skewness | -2.6246264 |
| Sum | 1.9822434 × 1013 |
| Variance | 1.6909811 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 96107057 | 1 | < 0.1% |
| 106362091 | 1 | < 0.1% |
| 112873334 | 1 | < 0.1% |
| 113698989 | 1 | < 0.1% |
| 110476726 | 1 | < 0.1% |
| 111619968 | 1 | < 0.1% |
| 106844375 | 1 | < 0.1% |
| 111072780 | 1 | < 0.1% |
| 111418434 | 1 | < 0.1% |
| 107744065 | 1 | < 0.1% |
| Other values (186113) | 186113 |
| Value | Count | Frequency (%) |
| 28871743 | 1 | |
| 29301391 | 1 | |
| 29301750 | 1 | |
| 29370795 | 1 | |
| 29506747 | 1 | |
| 29707618 | 1 | |
| 29718404 | 1 | |
| 29718866 | 1 | |
| 29718867 | 1 | |
| 29718995 | 1 |
| Value | Count | Frequency (%) |
| 115711660 | 1 | |
| 115711556 | 1 | |
| 115711546 | 1 | |
| 115711544 | 1 | |
| 115711540 | 1 | |
| 115711539 | 1 | |
| 115711538 | 1 | |
| 115711536 | 1 | |
| 115711535 | 1 | |
| 115711534 | 1 |
| bundesland | city | district | street | zip_code | has_kitchen | balcony | lift | garden | floor | flat_type | telekom_uploadspeed | firing_type | heating_type | number_of_rooms | square_meter | base_rent | total_rent | service_charge | immoscout_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Nordrhein_Westfalen | Dortmund | Schüren | Schüruferstraße | 44269 | False | False | False | True | 1.0 | ground_floor | 10.0 | oil | central_heating | 4 | 86 | 595 | 840 | 245.0 | 96107057 |
| 1 | Sachsen | Dresden | Äußere_Neustadt_Antonstadt | Turnerweg | 1097 | False | True | True | False | 3.0 | apartment | 2.4 | NaN | floor_heating | 3 | 84 | 965 | 1300 | 255.0 | 113147523 |
| 2 | Bremen | Bremen | Neu_Schwachhausen | Hermann-Henrich-Meier-Allee | 28213 | False | True | False | False | 1.0 | apartment | NaN | gas | self_contained_central_heating | 3 | 85 | 765 | 903 | 138.0 | 114751222 |
| 3 | Sachsen | Mittelsachsen_Kreis | Freiberg | Am Bahnhof | 9599 | False | False | False | True | 1.0 | NaN | 2.4 | NaN | self_contained_central_heating | 2 | 62 | 310 | 380 | 70.0 | 114391930 |
| 5 | Baden_Württemberg | Emmendingen_Kreis | Denzlingen | no_information | 79211 | True | False | False | False | 2.0 | roof_storey | 40.0 | oil | oil_heating | 2 | 53 | 580 | 690 | 110.0 | 106416361 |
| 6 | Sachsen | Chemnitz | Sonnenberg | Hofer Straße | 9130 | False | True | False | False | 3.0 | apartment | 40.0 | gas | NaN | 2 | 40 | 219 | 307 | 88.0 | 112923517 |
| 7 | Sachsen | Mittelsachsen_Kreis | Frankenberg/Sachsen | no_information | 9669 | False | False | False | True | 1.0 | NaN | 2.4 | gas | central_heating | 3 | 80 | 400 | 555 | 155.0 | 109842225 |
| 9 | Nordrhein_Westfalen | Hamm | Mitte | no_information | 59065 | False | False | False | False | 4.0 | apartment | 40.0 | oil | central_heating | 4 | 123 | 950 | 1150 | 200.0 | 101730329 |
| 10 | Nordrhein_Westfalen | Dortmund | Kirchhörde | Am Dimberg | 44229 | False | True | True | False | 0.0 | ground_floor | 2.4 | gas | gas_heating | 3 | 87 | 973 | 1321 | 215.0 | 92798563 |
| 11 | Thüringen | Weimar | Schöndorf | Birkenhof | 99427 | False | True | False | False | 4.0 | apartment | 2.4 | district_heating | district_heating | 1 | 37 | 220 | 300 | 80.0 | 106896167 |
| bundesland | city | district | street | zip_code | has_kitchen | balcony | lift | garden | floor | flat_type | telekom_uploadspeed | firing_type | heating_type | number_of_rooms | square_meter | base_rent | total_rent | service_charge | immoscout_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 228321 | Nordrhein_Westfalen | Neuss_Rhein_Kreis | Neuss | Further Str. | 41462 | False | False | False | False | 4.0 | apartment | 40.0 | natural_gas_light | self_contained_central_heating | 3 | 98 | 740 | 920 | 180.0 | 113557363 |
| 228322 | Sachsen_Anhalt | Magdeburg | Hopfengarten | Gustav-Ricker-Str. | 39120 | False | True | False | False | 1.0 | other | NaN | heat_supply | central_heating | 2 | 55 | 380 | 515 | 135.0 | 113358959 |
| 228323 | Bayern | München | Maxvorstadt | no_information | 80799 | True | True | False | False | 0.0 | ground_floor | 10.0 | NaN | district_heating | 2 | 65 | 1780 | 1980 | 100.0 | 104772937 |
| 228324 | Hessen | Frankfurt_am_Main | Preungesheim | Gundelandst. | 60435 | True | True | True | True | 2.0 | apartment | 2.4 | district_heating | district_heating | 3 | 90 | 1255 | 1480 | 112.0 | 106995489 |
| 228325 | Sachsen_Anhalt | Magdeburg | Cracau | Thomas-Mann-Str. | 39114 | False | True | False | False | 2.0 | apartment | 40.0 | gas:electricity | central_heating | 3 | 57 | 303 | 425 | 98.0 | 110721511 |
| 228326 | Sachsen | Zwickau | Nordvorstadt | Mühlpfortstraße | 8058 | True | False | False | False | 3.0 | maisonette | 40.0 | NaN | NaN | 2 | 60 | 300 | 440 | 140.0 | 111857041 |
| 228327 | Sachsen | Chemnitz | Kappel | Neefestraße | 9119 | False | True | False | True | 1.0 | apartment | 40.0 | gas | central_heating | 2 | 55 | 248 | 368 | 120.0 | 91110231 |
| 228328 | Nordrhein_Westfalen | Essen | Horst | no_information | 45279 | False | False | False | False | 3.0 | roof_storey | 2.4 | gas | gas_heating | 3 | 85 | 590 | 670 | 80.0 | 115526313 |
| 228330 | Hessen | Bergstraße_Kreis | Viernheim | no_information | 68519 | True | True | False | False | 1.0 | apartment | NaN | gas | gas_heating | 4 | 115 | 930 | 1150 | 220.0 | 96981497 |
| 228331 | Hessen | Limburg_Weilburg_Kreis | Limburg_an_der_Lahn | Emsbachstrasse | 65552 | False | True | False | True | 1.0 | apartment | 40.0 | gas | central_heating | 4 | 95 | 650 | 930 | 220.0 | 66924271 |